Measures of Clade Confidence Do Not Correlate with Accuracy of Phylogenetic Trees

نویسندگان

  • Barry G. Hall
  • Stephen J. Salipante
چکیده

Metrics of phylogenetic tree reliability, such as parametric bootstrap percentages or Bayesian posterior probabilities, represent internal measures of the topological reproducibility of a phylogenetic tree, while the recently introduced aLRT (approximate likelihood ratio test) assesses the likelihood that a branch exists on a maximum-likelihood tree. Although those values are often equated with phylogenetic tree accuracy, they do not necessarily estimate how well a reconstructed phylogeny represents cladistic relationships that actually exist in nature. The authors have therefore attempted to quantify how well bootstrap percentages, posterior probabilities, and aLRT measures reflect the probability that a deduced phylogenetic clade is present in a known phylogeny. The authors simulated the evolution of bacterial genes of varying lengths under biologically realistic conditions, and reconstructed those known phylogenies using both maximum likelihood and Bayesian methods. Then, they measured how frequently clades in the reconstructed trees exhibiting particular bootstrap percentages, aLRT values, or posterior probabilities were found in the true trees. The authors have observed that none of these values correlate with the probability that a given clade is present in the known phylogeny. The major conclusion is that none of the measures provide any information about the likelihood that an individual clade actually exists. It is also found that the mean of all clade support values on a tree closely reflects the average proportion of all clades that have been assigned correctly, and is thus a good representation of the overall accuracy of a phylogenetic tree.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Retraction: Measures of Clade Confidence Do Not Correlate with Accuracy of Phylogenetic Trees

As a result of a bug in the Perl script used to compare estimated trees with true trees, the clade confidence measures were sometimes associated with the incorrect clades. The error was detected by the sharp eye of Professor Sarah P. Otto of the University of British Columbia. She noticed a discrepancy between the example tree in Figure 1B and the results reported for the gene nuoK in Table 1, ...

متن کامل

Phylogenetic relationships of the commercial marine shrimp family Penaeidae from Persian Gulf

Phylogenetic relationships among all described species (total of 5 taxa) of the shrimp genus Penaeus, were examined with nucleotide sequence data from portions of mitochondrial gene and cytochrome oxidase subunit I (COI). There are twelve commercial shrimp in the Iranian coastal waters. The reconstruction of the evolution phylogeny of these species is crucial in revealing stock identity that ca...

متن کامل

Phylogenetic relationships of the commercial marine shrimp family Penaeidae from Persian Gulf

Phylogenetic relationships among all described species (total of 5 taxa) of the shrimp genus Penaeus, were examined with nucleotide sequence data from portions of mitochondrial gene and cytochrome oxidase subunit I (COI). There are twelve commercial shrimp in the Iranian coastal waters. The reconstruction of the evolution phylogeny of these species is crucial in revealing stock identity that ca...

متن کامل

On the reliability of Bayesian posterior clade probabilities in phylogenetic analysis

This article discusses possible reasons why posterior clade probabilities obtained from Bayesian phylogenetic analyses might be inaccurate. It attempts to list all possible sources of uncertainty and error in Bayesian phylogenetic analysis. The choice of priors on trees has been suggested by several authors as a cause of inaccurate posterior clade probabilities. I argue strongly for using prior...

متن کامل

Molecular Phylogeny of the Genus Lathyrus (Fabaceae-Fabeae) Based on cpDNA matK Sequence in Iran

Background: More than 60 species of the genus Lathyrus are distributed in Southwest Asia. It is the second largest genus of the tribe Fabeae, after Vicia, in the region (and in Iran with 23 species). In the regional Flora (Flora of Turkey, FloraIranicaand flora...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PLoS Computational Biology

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2007